STIC Biotechnology Systems Branch 



RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 

Source: [ /F^n ~ 

Date Processed by STIC: 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.4.0 PROGRAM, ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker/chkrnote.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/dovvnloads/documents.htm> , EFS Submission 

User Manual - ePAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 

3. Hand Carry, Federal Express, United Parcel Service, or other delivery service (EFFECTIVE 01/14/05): 
U.S. Patent and Trademark Office, Mail Stop Sequence, Customer Window, Randolph Building, 401 Dulany Street, 
Alexandria, VA 22314 

Revised 01/10/06 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



I: 



1 



10 



11 



12 



^Wrapped Nucleics The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
Wrapped Aminos was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 

Jnvalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



_Misaligned Amino The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers; 
Numbering use space characters, instead. 

_Non-ASCII The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 

ensure your subsequent submission is saved in ASCII text. 

_Variable Length Sequence(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 



_PatentIn 2.0 
"bug" 



Skipped Sequences 
(OLD RULES) 



each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 
Please also adjust the "(ii) NUMBER OF SEQUENCES:' 



JJse of n's or Xaa's 
(NEW RULES) 



Jnvalid <213> 
Response 



Use of<220> 



i 



_PatentIn 2.0 
"bug" 



response to include the skipped sequences. 



Skipped Sequences 
(NEW RULES) 



Sequence(s) 



missing. If intentional, please insert the following lines for each skipped sequence. 



<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence, (see item 1 1 below) 



Se quence^ min in g the <2?Q> "Fran i nr" an il assoi in ii n'ri n ii^ntifirrr . and rp ispon g<>g 
f<220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or "Unknown." 
Please explain source of genetic material in <220> to <223> section or use "chemically synthesized" as 
explanation. ( See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32), also Sec. 1.823 of 
Sequence Rules 



Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 



Misuse of n/Xaa "n" can only represent a single nucleotide: "Xaa" can only represent a single amino acid 
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RAW SEQUENCE LISTING DATE: 09/26/2006 

PATENT APPLICATION: US/10/563 , 826 TIME: 10:24:13 

Input Set : A:\2006-07-10 0365-0662PUSl.txt 
Output Set: N:\CRF4\09262006\J563826.raw 



5 <110> APPLICANT: LINDER, Markus et al . 

7 <120> TITLE OF INVENTION: A METHOD FOR CLEAVING PROTEINS 

9 <130> FILE REFERENCE: 0365 -0662PUS1 

11 <140> CURRENT APPLICATION NUMBER: 

12 <141> CURRENT FILING DATE: 2006-01-06 

14 <150> PRIOR APPLICATION NUMBER: PCT/FI04/00439 

15 <151> PRIOR FILING DATE: 2004-07-08 

17 <150> PRIOR APPLICATION NUMBER: 2001050 'Z^Cq 



10/563,826 S^AA-'f^p- 



18 <151> PRIOR FILING DATE: 2003-07-09 
20 <160> NUMBER OF SEQ ID NOS : 30 



22 <170> SOFTWARE: Patent In version 3.1 

24 <210> SEQ ID NO: 1 , ^ . i I . . Z Jh "^Bf 

25 <211> LENGTH: 22 
2 6 <212> TYPE: PRT 
2 7 <213> ORGANISM: 
2 9 <220> FEATURE: 
30 



Artificial Sequence * ' D[Ajjt~ /ti/fl *^ 1 *^—^ 

<220> FEATURE: "*\ /[ A " ~j ^ 

<223> OTHER INFORMATION: amino acid linker sequence fromfFig. 2 J " Cn G JLaJIAJ l< --' 

32 <400> SEQUENCE: 1 J s f 

33 Gly Ser Pro Thr Gly Ala Ser Thr His His His His His His Gly Ser ' 0 1 A 

34 1 5 10 15 

36 Pro Thr Gly Ala Ser Thr 

37 20 



40 <210> SEQ ID NO: 2 

41 <211> LENGTH: 22 

42 <212> TYPE: PRT 

43 <213> ORGANISM: Artificial Sequence 

45 <220> FEATURE: //^^S 

46 <223> OTHER INFORMATION: amino acid sequence fron/ Fig. 3 ) 

48 <400> SEQUENCE: 2 \ ^ \ y 

49 Gly Ser Pro Thr Gly Ala Ser Thr Gly Gly Gly Gly Gly Gly Gly Ser 

50 1 5 10 15 

53 Pro Thr Gly Ala Ser Thr 

54 20 

57 <210> SEQ ID NO: 3 

58 <211> LENGTH: 22 

59 <212> TYPE: PRT 

60 <213> ORGANISM: Artificial Sequence 

62 <220> FEATURE: 

63 <223> OTHER INFORMATION: amino acid sequence frorl 

65 <400> SEQUENCE: 3 

66 Gly Ser Pro Thr Gly Ala Ser Thr His His His His His His Gly Ser 

67 1 5 10 
70 Pro Thr Gly Ala Ser Thr 




file://C: \CRF4\Outhold\VsrJ563826.htm 
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RAW SEQUENCE LISTING DATE: 09/26/2006 

PATENT APPLICATION: US/10/563,826 TIME: 10:24:13 

Input Set : A:\2006-07-10 0365-0662PUSl.txt 
Output Set: N:\CRP4\09262006\J563826.raw 

71 20 

74 <210> SEQ ID NO: 4 

75 <211> LENGTH: 22 

76 <212> TYPE: PRT 

77 <213> ORGANISM: Artificial Sequence 

79 <220> FEATURE: 

80 <223> OTHER INFORMATION: amino acid sequence from 

82 <400> SEQUENCE: 4 

83 Gly Ser Pro Thr Gly Ala Ser Thr Gly Ser Thr Gly Pro Ser Gly Ser 

84 1 5 10 15 

87 Pro Thr Gly Ala Ser Thr 

88 20 

91 <210> SEQ ID NO: 5 

92 <211> LENGTH: 20 

93 <212> TYPE: PRT 

94 <213> ORGANISM: Artificial Sequence 

96 <220> FEATURE: 

97 <223> OTHER INFORMATION: amino acid sequence from 

99 <400> SEQUENCE: 5 

100 Gly Ser Pro Thr Gly Ala Ser Thr His His His His Gly Ser Pro Thr 

101 15 10 15 

104 Gly Ala Ser Thr 

105 20 

108 <210> SEQ ID NO: 6 

109 <211> LENGTH: 18 

110 <212> TYPE: PRT 

111 <213> ORGANISM: Artificial Sequence 

113 <220> FEATURE: 

114 <223> OTHER INFORMATION: amino acid sequence from 

116 <400> SEQUENCE: 6 

117 Gly Ser Pro Thr Gly Ala Ser Thr His His Gly Ser Pro Thr Gly Ala 

118 15 10 15 
121 Ser Thr 

125 <210> SEQ ID NO: 7 

126 <211> LENGTH: 24 
12 7 <212> TYPE: PRT 

128 <213> ORGANISM: Artificial Sequence 

130 <220> FEATURE: 

131 <223> OTHER INFORMATION: amino acid sequence from 

133 <400> SEQUENCE: 7 

134 Gly Ser Pro Thr Gly Ala Ser Thr His His His His His His His His 

135 15 10 15 

138 Gly Ser Pro Thr Gly Ala Ser Thr 

139 20 

142 <210> SEQ ID NO: 8 

143 <211> LENGTH: 27 

144 <212> TYPE: PRT 

145 <213> ORGANISM: Artificial Sequence 
147 <220> FEATURE: 
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RAW SEQUENCE LISTING DATE: 09/26/2006 

PATENT APPLICATION: US/10/563 , 826 TIME: 10:24:13 

Input Set : A:\2006-07-10 0365-0662PUSl.txt 
Output Set: N:\CRP4\09262006\J563826. raw 




148 <223> OTHER INFORMATION: amino acid sequence 

150 <400> SEQUENCE: 8 

151 Gly Ser Pro Thr Gly Ala Ser Thr His Ser His Ala His Gly His Ala 

152 1 5 10 15 

155 His Ser His Gly Ser Pro Thr Gly Ala Ser Thr 

156 20 25 

159 <210> SEQ ID NO: 9 

160 <211> LENGTH: 12 

161 <212> TYPE: PRT 

162 <213> ORGANISM: Artificial Sequence 

164 <220> FEATURE: 

165 <223> OTHER INFORMATION: amino acid sequence referred to 

167 <400> SEQUENCE: 9 

168 His Ser His Ala His Gly His Ala His Ser His Gly ^ 

172 <210> SEQ ID NO: 10 

173 <211> LENGTH: 40 a ~\ *2 (J ^COl/ 

174 <212> TYPE: DNA A^A^^^ 




175 <213> ORGANISM: Artificial sequence 

177 <220> FEATURE: 

178 <223> OTHER INFORMATION: oligonucleotide used to PCR amplify the DNA fragment 

179 encoding ABP 

181 <400> SEQUENCE: 10 

182 gcattggatt cgaattctta gctgaagcta aagtcttagc 40 

185 <210> SEQ ID NO: 11 

186 <211> LENGTH: 34 

187 <212> TYPE: DNA 

188 <213> ORGANISM: Artificial sequence 

190 <220> FEATURE: 

191 <223> OTHER INFORMATION: oligonucleotide used to PCR amplify the DNA fragment 

192 encoding ABP 

194 <400> SEQUENCE: 11 

195 gcattaagct tctattcgct ttttgccgga gtag 34 

198 <210> SEQ ID NO: 12 

199 <211> LENGTH: 69 

200 <212> TYPE: DNA 

201 <213> ORGANISM: Artificial sequence 

203 <220> FEATURE: 

204 <223> OTHER INFORMATION: oligonucleotide used to generate pLink2 

206 <400> SEQUENCE: 12 

207 cgggtagccc aaccggcgcg agcacccatc accatcacca tcacggtagc ccaaccggcg 60 
209 cgagcaccg 69 

212 <210> SEQ ID NO: 13 

213 <211> LENGTH: 77 

214 <212> TYPE: DNA 

215 <213> ORGANISM: Artificial sequence 

217 <220> FEATURE: 

218 <223> OTHER INFORMATION: oligonucleotide used to generate pLink2 
220 <400> SEQUENCE: 13 
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RAW SEQUENCE LISTING DATE: 09/26/2006 

PATENT APPLICATION: US/10/563 , 826 TIME: 10:24:13 

Input Set : A:\2006-07-10 0365-0662PUSl.txt 
Output Set: N:\CRF4\09262006\J563826.raw 

221 aattcggtgc tcgcgccggt tgggctaccg tgatggtgat ggtgatgggt gctcgcgccg 60 
223 gttgggctac ccgagct 77 

226 <210> SEQ ID NO: 14 

227 <211> LENGTH: 69 

228 <212> TYPE: DNA 

229 <213> ORGANISM: Artificial sequence 

231 <220> FEATURE: 

232 <223> OTHER INFORMATION: oligonucleotide used to generate pLink3 

234 <400> SEQUENCE: 14 

235 cgggtagccc aaccggcgcg agcaccggcg gtggtggtgg cggcggtagc ccaaccggcg 60 

23 7 cgagcaccg 69 

240 <210> SEQ ID NO: 15 

241 <211> LENGTH: 77 

242 <212> TYPE: DNA 

243 <213> ORGANISM: Artificial sequence 

245 <220> FEATURE: 

246 <223> OTHER INFORMATION: oligonucleotide used to generate pLink3 
248 <400> SEQUENCE: 15 

24 9 aattcggtgc tcgcgccggt tgggctaccg ccgccaccac cagggccggt gctcgcgccg 60 
251 gttgggctac ccgagct 77 

254 <210> SEQ ID NO: 16 

255 <211> LENGTH: 33 

256 <212> TYPE: DNA 

257 <213> ORGANISM: Artificial sequence 

259 <220> FEATURE: 

260 <223> OTHER INFORMATION: oligonucleotide used to generate pLink6 

262 <400> SEQUENCE: 16 

263 gcattgaatt cgacccctcc aaggactcga agg 33 

266 <210> SEQ ID NO: 17 

267 <211> LENGTH: 33 

268 <212> TYPE: DNA 

269 <213> ORGANISM: Artificial sequence 

271 <220> FEATURE: 

272 <223> OTHER INFORMATION: oligonucleotide used to generate pLink6 

274 <400> SEQUENCE: 17 

275 gcattaagct tctactgctg aacggcgtcg age 33 

278 <210> SEQ ID NO: 18 

279 <211> LENGTH: 69 

280 <212> TYPE: DNA 

281 <213> ORGANISM: Artificial sequence 
2 83 <22 0> FEATURE: 

284 <223> OTHER INFORMATION: oligonucleotide used to generate pLink7 

286 <400> SEQUENCE: 18 

287 cgggtagccc aaccggcgcg agcaccggca gcaccggtcc aageggtage ccaaccggcg 60 
28 9 cgagcaccg 69 

292 <210> SEQ ID NO: 19 

293 <211> LENGTH: 77 
2 94 <212> TYPE: DNA 

295 <213> ORGANISM: Artificial sequence 
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RAW SEQUENCE LISTING DATE : 09/26/2006 

PATENT APPLICATION: US/10/563 , 82 6 TIME: 10:24:13 



Input Set : A:\2006-07-10 0365-0662PUSl.txt 
Output Set: N:\CRF4\09262006\J563826.raw 

297 <220> FEATURE: 

298 <223> OTHER INFORMATION: oligonucleotide used to generate pLink7 

300 <400> SEQUENCE: 19 

301 aattcggtgc tcgcgccggt tgggctaccg cttggaccgg tgctgccggt gctcgcgccg 
303 gttgggctac ccgagct 

306 <210> SEQ ID NO: 20 

307 <211> LENGTH: 63 

308 <212> TYPE: DNA 

309 <213> ORGANISM: Artificial sequence 

311 <220> FEATURE: 

312 <223> OTHER INFORMATION: oligonucleotide used to generate pLink8 

314 <400> SEQUENCE: 20 

315 cgggtagccc aaccggcgcg agcacccatc accatcacgg tagcccaacc ggcgcgagca 
317 ccg 

320 <210> SEQ ID NO: 21 

321 <211> LENGTH: 67 

322 <212> TYPE: DNA 

323 <213> ORGANISM: Artificial sequence 

325 <220> FEATURE: 

326 <223> OTHER INFORMATION: oligonucleotide used to generate pLink8 
328 <400> SEQUENCE: 21 

32 9 aattcggtgc tcgcgccggt tgggctaccg tgatggtgat gggtgctcgc gccggttggg 
331 ctacccg 

334 <210> SEQ ID NO: 22 

335 <211> LENGTH: 56 

336 <212> TYPE: DNA 

337 <213> ORGANISM: Artificial sequence 

339 <220> FEATURE: 

340 <223> OTHER INFORMATION: oligonucleotide used to generate pLinklO 

342 <400> SEQUENCE: 22 

343 cgggtagccc aaccggcgcg agcacccatc acggtagccc aaccggcgcg agcacc 

346 <210> SEQ ID NO: 23 

347 <211> LENGTH: 65 

348 <212> TYPE: DNA 

349 <213> ORGANISM: Artificial sequence 

351 <220> FEATURE: 

352 <223> OTHER INFORMATION: oligonucleotide used to generate pLinklO 

354 <400> SEQUENCE: 23 

355 aattcggtgc tcgcgccggt tgggctaccg tgatgggtgc tcgcgccggt tgggctaccc 
357 gagct 

360 <210> SEQ ID NO: 24 

361 <211> LENGTH: 75 

362 <212> TYPE: DNA 

363 <213> ORGANISM: Artificial sequence 

365 <22 0> FEATURE: 

366 <223> OTHER INFORMATION: oligonucleotide used to generate pLinkl2 

368 <400> SEQUENCE: 24 

369 cgggtagccc aaccggcgcg agcacccacc atcaccatca ccatcaccat ggtagcccaa 
3 71 ccggcgcgag caccg 
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VERIFICATION SUMMARY DATE: 09/26/2006 

PATENT APPLICATION: US/10/563,826 TIME: 10:24:14 

Input Set : A:\2006-07-10 0365-0662PUSl.txt 
Output Set: N:\CRF4\09262006\J563826.raw 
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